A Weighted Hybrid Thresholding Approach for Text Binarization
نویسندگان
چکیده
Text extraction in real images taken in unconstrained environments remains surprisingly challenging in Computer Vision due to language characteristics, complex backgrounds and the text color. Extraction of text and caption from images and videos is important and in great demand for video retrieval, annotation, indexing and content analysis. In this paper we propose a weighted hybrid thresholding approach. It is demonstrated that the proposed method achieved reasonable accuracy of the text extraction for moderately difficult examples.
منابع مشابه
Hybrid Binariztion Technique for Historical Manuscripts
This paper presents a new hybrid approach for the binarization and enhancement of Historical Manuscript. This paper deals with degradations which occur due to shadows, non-uniform illumination, low contrast and strain. We follow two distinct method of Binarization with a pre-processing procedure using a adaptive Wiener filter, a rough estimation of foreground regions and a background surface ca...
متن کاملText/ Background separation in the degraded document images by combining several thresholding techniques
Extract the text from the background is an important step in all process of document analysis and recognition. If this extraction is easy for document images of good quality by applying simple techniques of global thresholding, the images of degraded documents require a more accurate analysis and we have recourse in this case to local methods. Indeed, these latter are generally more efficient a...
متن کاملA Hybrid Binarization Technique for Document Images
In this chapter, a binarization technique specifically designed for historical document images is presented. Existing binarization techniques focus either on finding an appropriate global threshold or adapting a local threshold for each area in order to remove smear, strains, uneven illumination etc. Here, a hybrid approach is presented that first applies a global thresholding technique and, th...
متن کاملA Review on Global Binarization Algorithms for Degraded Document Images
Several algorithms have previously been proposed for improving the thresholding of degraded document images. No algorithm can solve all types of problems, but some algorithms are better than others for specific situations. This article reviews global binarization algorithms for improving degraded document images, thus indicating their differences and similarities, and also their advantages and ...
متن کاملCombining multiple thresholding binarization values to improve OCR output
For noisy, historical documents, a high optical character recognition (OCR) word error rate (WER) can render the OCR text unusable. Since image binarization is often the method used to identify foreground pixels, a significant body of research has sought to improve image-wide binarization directly. Instead of relying on any one imperfect binarization technique, our method incorporates informati...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012